Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 2344823 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3438 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 125.2 MiB |
| Average record size in memory | 56.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 5 |
| Dataset has 3438 (0.1%) duplicate rows | Duplicates |
IN_TREINEIRO is highly overall correlated with TP_FAIXA_ETARIA and 1 other fields | High correlation |
Q001 is highly overall correlated with Q002 and 1 other fields | High correlation |
Q002 is highly overall correlated with Q001 and 1 other fields | High correlation |
Q003 is highly overall correlated with Q001 | High correlation |
Q004 is highly overall correlated with Q002 | High correlation |
TP_ESCOLA is highly overall correlated with TP_ST_CONCLUSAO | High correlation |
TP_FAIXA_ETARIA is highly overall correlated with IN_TREINEIRO | High correlation |
TP_ST_CONCLUSAO is highly overall correlated with IN_TREINEIRO and 1 other fields | High correlation |
TP_COR_RACA has 40871 (1.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-15 02:47:13.733355 |
|---|---|
| Analysis finished | 2024-04-15 02:48:20.884816 |
| Duration | 1 minute and 7.15 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
TP_FAIXA_ETARIA
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2380606 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 12 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.3427183 |
|---|---|
| Coefficient of variation (CV) | 0.78873774 |
| Kurtosis | 2.2625056 |
| Mean | 4.2380606 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6837301 |
| Sum | 9937502 |
| Variance | 11.173766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 593355 | |
| 2 | 576153 | |
| 4 | 269293 | |
| 1 | 247749 | |
| 5 | 151508 | 6.5% |
| 6 | 95792 | 4.1% |
| 11 | 88159 | 3.8% |
| 7 | 67515 | 2.9% |
| 8 | 49832 | 2.1% |
| 12 | 47083 | 2.0% |
| Other values (10) | 158384 | 6.8% |
| Value | Count | Frequency (%) |
| 1 | 247749 | |
| 2 | 576153 | |
| 3 | 593355 | |
| 4 | 269293 | |
| 5 | 151508 | 6.5% |
| 6 | 95792 | 4.1% |
| 7 | 67515 | 2.9% |
| 8 | 49832 | 2.1% |
| 9 | 36867 | 1.6% |
| 10 | 30176 | 1.3% |
| Value | Count | Frequency (%) |
| 20 | 315 | < 0.1% |
| 19 | 852 | < 0.1% |
| 18 | 2090 | 0.1% |
| 17 | 5149 | 0.2% |
| 16 | 9309 | 0.4% |
| 15 | 15339 | 0.7% |
| 14 | 23948 | 1.0% |
| 13 | 34339 | 1.5% |
| 12 | 47083 | |
| 11 | 88159 |
TP_COR_RACA
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.991332 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 40871 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.0184765 |
|---|---|
| Coefficient of variation (CV) | 0.51145492 |
| Kurtosis | -1.3097986 |
| Mean | 1.991332 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1328009 |
| Sum | 4669321 |
| Variance | 1.0372944 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1026418 | |
| 3 | 966698 | |
| 2 | 255863 | 10.9% |
| 4 | 43782 | 1.9% |
| 0 | 40871 | 1.7% |
| 5 | 11191 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 40871 | 1.7% |
| 1 | 1026418 | |
| 2 | 255863 | 10.9% |
| 3 | 966698 | |
| 4 | 43782 | 1.9% |
| 5 | 11191 | 0.5% |
| Value | Count | Frequency (%) |
| 5 | 11191 | 0.5% |
| 4 | 43782 | 1.9% |
| 3 | 966698 | |
| 2 | 255863 | 10.9% |
| 1 | 1026418 | |
| 0 | 40871 | 1.7% |
TP_ST_CONCLUSAO
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 6903 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
TP_ESCOLA
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
IN_TREINEIRO
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
TP_LINGUA
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Q001
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5935045 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.8760278 |
|---|---|
| Coefficient of variation (CV) | 0.40840883 |
| Kurtosis | -0.75607041 |
| Mean | 4.5935045 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.03745095 |
| Sum | 10770955 |
| Variance | 3.5194803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 720913 | |
| 2 | 350284 | |
| 3 | 291859 | |
| 4 | 259673 | 11.1% |
| 6 | 250540 | 10.7% |
| 8 | 202637 | 8.6% |
| 7 | 193050 | 8.2% |
| 1 | 75867 | 3.2% |
| Value | Count | Frequency (%) |
| 1 | 75867 | 3.2% |
| 2 | 350284 | |
| 3 | 291859 | |
| 4 | 259673 | 11.1% |
| 5 | 720913 | |
| 6 | 250540 | 10.7% |
| 7 | 193050 | 8.2% |
| 8 | 202637 | 8.6% |
| Value | Count | Frequency (%) |
| 8 | 202637 | 8.6% |
| 7 | 193050 | 8.2% |
| 6 | 250540 | 10.7% |
| 5 | 720913 | |
| 4 | 259673 | 11.1% |
| 3 | 291859 | |
| 2 | 350284 | |
| 1 | 75867 | 3.2% |
Q002
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8018234 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6215216 |
|---|---|
| Coefficient of variation (CV) | 0.33768873 |
| Kurtosis | -0.43221959 |
| Mean | 4.8018234 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.32283434 |
| Sum | 11259426 |
| Variance | 2.6293324 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 851162 | |
| 6 | 329384 | 14.0% |
| 7 | 322603 | 13.8% |
| 4 | 262334 | 11.2% |
| 2 | 238683 | 10.2% |
| 3 | 232165 | 9.9% |
| 8 | 62486 | 2.7% |
| 1 | 46006 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 46006 | 2.0% |
| 2 | 238683 | 10.2% |
| 3 | 232165 | 9.9% |
| 4 | 262334 | 11.2% |
| 5 | 851162 | |
| 6 | 329384 | 14.0% |
| 7 | 322603 | 13.8% |
| 8 | 62486 | 2.7% |
| Value | Count | Frequency (%) |
| 8 | 62486 | 2.7% |
| 7 | 322603 | 13.8% |
| 6 | 329384 | 14.0% |
| 5 | 851162 | |
| 4 | 262334 | 11.2% |
| 3 | 232165 | 9.9% |
| 2 | 238683 | 10.2% |
| 1 | 46006 | 2.0% |
Q003
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2131082 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5404451 |
|---|---|
| Coefficient of variation (CV) | 0.47942521 |
| Kurtosis | -0.86074084 |
| Mean | 3.2131082 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.24137501 |
| Sum | 7534170 |
| Variance | 2.372971 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 531108 | |
| 4 | 525019 | |
| 2 | 437711 | |
| 1 | 387595 | |
| 6 | 260803 | |
| 5 | 202587 | 8.6% |
| Value | Count | Frequency (%) |
| 1 | 387595 | |
| 2 | 437711 | |
| 3 | 531108 | |
| 4 | 525019 | |
| 5 | 202587 | 8.6% |
| 6 | 260803 |
| Value | Count | Frequency (%) |
| 6 | 260803 | |
| 5 | 202587 | 8.6% |
| 4 | 525019 | |
| 3 | 531108 | |
| 2 | 437711 | |
| 1 | 387595 |
Q004
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0045539 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4813366 |
|---|---|
| Coefficient of variation (CV) | 0.49303048 |
| Kurtosis | -0.81308274 |
| Mean | 3.0045539 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.48345606 |
| Sum | 7045147 |
| Variance | 2.1943582 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 902602 | |
| 4 | 645302 | |
| 1 | 309181 | 13.2% |
| 6 | 196040 | 8.4% |
| 5 | 149110 | 6.4% |
| 3 | 142588 | 6.1% |
| Value | Count | Frequency (%) |
| 1 | 309181 | 13.2% |
| 2 | 902602 | |
| 3 | 142588 | 6.1% |
| 4 | 645302 | |
| 5 | 149110 | 6.4% |
| 6 | 196040 | 8.4% |
| Value | Count | Frequency (%) |
| 6 | 196040 | 8.4% |
| 5 | 149110 | 6.4% |
| 4 | 645302 | |
| 3 | 142588 | 6.1% |
| 2 | 902602 | |
| 1 | 309181 | 13.2% |
Q006
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0370689 |
| Minimum | 1 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.7950422 |
|---|---|
| Coefficient of variation (CV) | 0.75342273 |
| Kurtosis | 1.4457602 |
| Mean | 5.0370689 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.4291988 |
| Sum | 11811035 |
| Variance | 14.402345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 630492 | |
| 3 | 369704 | |
| 4 | 276804 | |
| 5 | 194527 | 8.3% |
| 8 | 145866 | 6.2% |
| 7 | 145812 | 6.2% |
| 1 | 119268 | 5.1% |
| 6 | 115203 | 4.9% |
| 9 | 62344 | 2.7% |
| 10 | 44115 | 1.9% |
| Other values (7) | 240688 | 10.3% |
| Value | Count | Frequency (%) |
| 1 | 119268 | 5.1% |
| 2 | 630492 | |
| 3 | 369704 | |
| 4 | 276804 | |
| 5 | 194527 | 8.3% |
| 6 | 115203 | 4.9% |
| 7 | 145812 | 6.2% |
| 8 | 145866 | 6.2% |
| 9 | 62344 | 2.7% |
| 10 | 44115 | 1.9% |
| Value | Count | Frequency (%) |
| 17 | 39060 | 1.7% |
| 16 | 29282 | 1.2% |
| 15 | 31826 | 1.4% |
| 14 | 28365 | 1.2% |
| 13 | 39300 | 1.7% |
| 12 | 41407 | 1.8% |
| 11 | 31448 | 1.3% |
| 10 | 44115 | 1.9% |
| 9 | 62344 | |
| 8 | 145866 |
Q024
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.9 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2344823 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 424222 |
NU_NOTA_MEDIA
Real number (ℝ)
| Distinct | 50309 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 543.48471 |
| Minimum | 0 |
|---|---|
| Maximum | 855.98 |
| Zeros | 22 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 402.08 |
| Q1 | 484.52 |
| median | 540.54 |
| Q3 | 602.06 |
| 95-th percentile | 693.14 |
| Maximum | 855.98 |
| Range | 855.98 |
| Interquartile range (IQR) | 117.54 |
Descriptive statistics
| Standard deviation | 88.04023 |
|---|---|
| Coefficient of variation (CV) | 0.1619921 |
| Kurtosis | -0.016444533 |
| Mean | 543.48471 |
| Median Absolute Deviation (MAD) | 58.58 |
| Skewness | 0.034281316 |
| Sum | 1.2743754 × 109 |
| Variance | 7751.0822 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 526.72 | 261 | < 0.1% |
| 557.2 | 259 | < 0.1% |
| 534.74 | 255 | < 0.1% |
| 531.5 | 254 | < 0.1% |
| 530.78 | 253 | < 0.1% |
| 513.4 | 251 | < 0.1% |
| 514.72 | 249 | < 0.1% |
| 529.5 | 249 | < 0.1% |
| 512.76 | 247 | < 0.1% |
| 530.6 | 246 | < 0.1% |
| Other values (50299) | 2342299 |
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 56.14 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 66.1 | 1 | < 0.1% |
| 69.8 | 1 | < 0.1% |
| 72.12 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 82.28 | 1 | < 0.1% |
| 89.12 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 855.98 | 1 | |
| 855.82 | 1 | |
| 851.84 | 1 | |
| 849.86 | 1 | |
| 848.32 | 1 | |
| 843.5 | 1 | |
| 842.02 | 1 | |
| 841.98 | 1 | |
| 841.76 | 1 | |
| 841.1 | 1 |
| IN_TREINEIRO | NU_NOTA_MEDIA | Q001 | Q002 | Q003 | Q004 | Q006 | Q024 | TP_COR_RACA | TP_ESCOLA | TP_FAIXA_ETARIA | TP_LINGUA | TP_ST_CONCLUSAO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IN_TREINEIRO | 1.000 | 0.012 | 0.116 | 0.160 | 0.087 | 0.113 | 0.159 | 0.125 | -0.068 | 0.386 | -0.582 | 0.109 | 1.000 |
| NU_NOTA_MEDIA | 0.012 | 1.000 | 0.256 | 0.315 | 0.248 | 0.278 | 0.463 | 0.306 | -0.228 | 0.192 | -0.073 | 0.277 | 0.064 |
| Q001 | 0.116 | 0.256 | 1.000 | 0.501 | 0.502 | 0.357 | 0.375 | 0.350 | -0.160 | 0.204 | -0.218 | 0.260 | 0.124 |
| Q002 | 0.160 | 0.315 | 0.501 | 1.000 | 0.349 | 0.519 | 0.466 | 0.325 | -0.177 | 0.191 | -0.283 | 0.236 | 0.134 |
| Q003 | 0.087 | 0.248 | 0.502 | 0.349 | 1.000 | 0.497 | 0.387 | 0.364 | -0.165 | 0.219 | -0.152 | 0.267 | 0.103 |
| Q004 | 0.113 | 0.278 | 0.357 | 0.519 | 0.497 | 1.000 | 0.464 | 0.347 | -0.183 | 0.201 | -0.194 | 0.249 | 0.107 |
| Q006 | 0.159 | 0.463 | 0.375 | 0.466 | 0.387 | 0.464 | 1.000 | 0.485 | -0.290 | 0.252 | -0.217 | 0.299 | 0.111 |
| Q024 | 0.125 | 0.306 | 0.350 | 0.325 | 0.364 | 0.347 | 0.485 | 1.000 | -0.258 | 0.200 | -0.133 | 0.277 | 0.094 |
| TP_COR_RACA | -0.068 | -0.228 | -0.160 | -0.177 | -0.165 | -0.183 | -0.290 | -0.258 | 1.000 | 0.111 | 0.110 | 0.180 | 0.066 |
| TP_ESCOLA | 0.386 | 0.192 | 0.204 | 0.191 | 0.219 | 0.201 | 0.252 | 0.200 | 0.111 | 1.000 | -0.321 | 0.134 | 0.707 |
| TP_FAIXA_ETARIA | -0.582 | -0.073 | -0.218 | -0.283 | -0.152 | -0.194 | -0.217 | -0.133 | 0.110 | -0.321 | 1.000 | 0.177 | 0.498 |
| TP_LINGUA | 0.109 | 0.277 | 0.260 | 0.236 | 0.267 | 0.249 | 0.299 | 0.277 | 0.180 | 0.134 | 0.177 | 1.000 | 0.136 |
| TP_ST_CONCLUSAO | 1.000 | 0.064 | 0.124 | 0.134 | 0.103 | 0.107 | 0.111 | 0.094 | 0.066 | 0.707 | 0.498 | 0.136 | 1.000 |
| TP_FAIXA_ETARIA | TP_COR_RACA | TP_ST_CONCLUSAO | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q006 | Q024 | NU_NOTA_MEDIA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5 | 2 | 1 | 1 | 0 | 1 | 5 | 6 | 1 | 4 | 2 | 1 | 558.24 |
| 1 | 6 | 3 | 1 | 1 | 0 | 1 | 3 | 1 | 1 | 2 | 1 | 2 | 394.62 |
| 2 | 6 | 2 | 1 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 2 | 1 | 414.10 |
| 3 | 4 | 3 | 1 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 2 | 1 | 438.10 |
| 4 | 2 | 1 | 2 | 3 | 0 | 1 | 5 | 5 | 2 | 1 | 2 | 1 | 576.70 |
| 5 | 2 | 3 | 3 | 1 | 1 | 0 | 7 | 6 | 6 | 6 | 2 | 2 | 530.58 |
| 6 | 8 | 2 | 1 | 1 | 0 | 1 | 2 | 6 | 1 | 4 | 2 | 1 | 645.80 |
| 7 | 1 | 3 | 3 | 1 | 1 | 0 | 8 | 5 | 3 | 2 | 2 | 2 | 378.74 |
| 8 | 4 | 1 | 1 | 1 | 0 | 1 | 2 | 4 | 4 | 2 | 2 | 2 | 500.40 |
| 9 | 4 | 3 | 1 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 5 | 1 | 605.58 |
| TP_FAIXA_ETARIA | TP_COR_RACA | TP_ST_CONCLUSAO | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q006 | Q024 | NU_NOTA_MEDIA | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2344813 | 12 | 3 | 1 | 1 | 0 | 0 | 3 | 4 | 6 | 2 | 7 | 3 | 599.36 |
| 2344814 | 2 | 1 | 2 | 2 | 0 | 0 | 4 | 5 | 3 | 3 | 6 | 2 | 526.38 |
| 2344815 | 6 | 0 | 1 | 1 | 0 | 0 | 2 | 5 | 1 | 1 | 4 | 1 | 533.66 |
| 2344816 | 3 | 2 | 1 | 1 | 0 | 0 | 5 | 5 | 6 | 2 | 2 | 2 | 467.20 |
| 2344817 | 3 | 1 | 2 | 2 | 0 | 0 | 5 | 6 | 3 | 4 | 6 | 2 | 515.02 |
| 2344818 | 12 | 1 | 1 | 1 | 0 | 1 | 5 | 5 | 3 | 3 | 3 | 1 | 488.40 |
| 2344819 | 11 | 2 | 1 | 1 | 0 | 1 | 8 | 3 | 6 | 3 | 4 | 2 | 617.92 |
| 2344820 | 2 | 3 | 2 | 2 | 0 | 0 | 8 | 5 | 3 | 2 | 2 | 1 | 541.22 |
| 2344821 | 11 | 1 | 1 | 1 | 0 | 0 | 3 | 2 | 2 | 2 | 2 | 1 | 507.22 |
| 2344822 | 2 | 1 | 2 | 2 | 0 | 0 | 5 | 3 | 3 | 2 | 7 | 2 | 607.06 |
Most frequently occurring
| TP_FAIXA_ETARIA | TP_COR_RACA | TP_ST_CONCLUSAO | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q006 | Q024 | NU_NOTA_MEDIA | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2360 | 3 | 3 | 2 | 2 | 0 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 447.90 | 4 |
| 78 | 1 | 1 | 3 | 1 | 1 | 0 | 6 | 6 | 4 | 4 | 7 | 3 | 620.10 | 3 |
| 108 | 1 | 1 | 3 | 1 | 1 | 0 | 6 | 6 | 5 | 5 | 17 | 3 | 643.98 | 3 |
| 281 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 16 | 3 | 635.92 | 3 |
| 395 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 633.72 | 3 |
| 400 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 636.74 | 3 |
| 412 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 645.70 | 3 |
| 428 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 653.16 | 3 |
| 431 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 655.16 | 3 |
| 441 | 1 | 1 | 3 | 1 | 1 | 0 | 7 | 7 | 5 | 5 | 17 | 3 | 661.10 | 3 |